Saturated Transformers are Constant-Depth Threshold Circuits

نویسندگان

چکیده

Abstract Transformers have become a standard neural network architecture for many NLP problems, motivating theoretical analysis of their power in terms formal languages. Recent work has shown that transformers with hard attention are quite limited (Hahn, 2020), as they can be simulated by constant-depth AND/OR circuits (Hao et al., 2022). However, is strong assumption, which may complicate the relevance these results practice. In this work, we analyze circuit complexity saturated attention: generalization more closely captures patterns learnable practical transformers. We first show transcend known limitations hard-attention then prove floating-point values threshold circuits, giving class TC0 an upper bound on languages recognize.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Uniform constant-depth threshold circuits for division and iterated multiplication

It has been known since the mid-1980's [15, 46, 47] that integer division can be performed by poly-time uniform constant-depth circuits of Majority gates; equivalently, the division problem lies in P-uniform TC 0. Recently this was improved to L-uniform TC 0 [19], but it remained unknown whether division can be performed by DLOGTIME-uniform TC 0 circuits. The DLOGTIME uniformity condition is re...

متن کامل

A Bounded Arithmetic Theory for Constant Depth Threshold Circuits

We deene an extension R 0 2 of the bounded arithmetic theory R 0 2 and show that the class of functions b 1-deenable in R 0 2 coincides with the computational complexity class TC 0 of functions computable by polynomial size, constant depth threshold circuits.

متن کامل

On Small Depth Threshold Circuits

In this talk we will consider various classes defined by small depth polynomial size circuits which contain threshold gates and parity gates. We will describe various inclusions between many classes defined in this way and also classes whose definitions rely upon spectral properties of Boolean functions.

متن کامل

Linear-Size Constant-Depth Polylog-Treshold Circuits

We present a simple explicit construction giving unbounded fan-in circuits with $o(n)$ gates and depth $O(r)$ for the threshold function of $n$ variables when the threshold is at most $(log n)^r$, for any integer $r>0$. This improves a result of Atjai and Ben-Or, who showed the existence of circuits of size $n^{O(1)}$. This is the highest threshold for which polynomial-size, constant-depth circ...

متن کامل

Pseudorandomness for Read-Once, Constant-Depth Circuits

For Boolean functions computed by read-once, depth-D circuits with unbounded fan-in over the de Morgan basis, we present an explicit pseudorandom generator with seed length Õ(log n). The previous best seed length known for this model was Õ(log n), obtained by Trevisan and Xue (CCC ‘13 ) for all of AC (not just read-once). Our work makes use of Fourier analytic techniques for pseudorandomness in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2022

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00493